Maximum likelihood implementation of an isolation-with-migration model with three species for testing speciation with gene flow.
نویسندگان
چکیده
We implement an isolation with migration model for three species, with migration occurring between two closely related species while an out-group species is used to provide further information concerning gene trees and model parameters. The model is implemented in the likelihood framework for analyzing multilocus genomic sequence alignments, with one sequence sampled from each of the three species. The prior distribution of gene tree topology and branch lengths at every locus is calculated using a Markov chain characterization of the genealogical process of coalescent and migration, which integrates over the histories of migration events analytically. The likelihood function is calculated by integrating over branch lengths in the gene trees (coalescent times) numerically. We analyze the model to study the gene tree-species tree mismatch probability and the time to the most recent common ancestor at a locus. The model is used to construct a likelihood ratio test (LRT) of speciation with gene flow. We conduct computer simulations to evaluate the LRT and found that the test is in general conservative, with the false positive rate well below the significance level. For the test to have substantial power, hundreds of loci are needed. Application of the test to a human-chimpanzee-gorilla genomic data set suggests gene flow around the time of speciation of the human and the chimpanzee.
منابع مشابه
The Generalised Isolation-With-Migration Model: a Maximum-Likelihood Implementation for Multilocus Data Sets
Statistical inference about the speciation process has often been based on the isolation-with-migration (IM) model, especially when the research aim is to learn about the presence or absence of gene flow during divergence. The generalised IM model introduced in this paper extends both the standard two-population IM model and the isolation-with-initial-migration (IIM) model, and encompasses both...
متن کاملMaximum Likelihood Implementation of an Isolation-with-Migration Model for Three Species.
We develop a maximum likelihood (ML) method for estimating migration rates between species using genomic sequence data. A species tree is used to accommodate the phylogenetic relationships among three species, allowing for migration between the two sister species, while the third species is used as an out-group. A Markov chain characterization of the genealogical process of coalescence and migr...
متن کاملInference of Gene Flow in the Process of Speciation: An Efficient Maximum-Likelihood Method for the Isolation-with-Initial-Migration Model
The isolation-with-migration (IM) model is commonly used to make inferences about gene flow during speciation, using polymorphism data. However, it has been reported that the parameter estimates obtained by fitting the IM model are very sensitive to the model's assumptions-including the assumption of constant gene flow until the present. This article is concerned with the isolation-with-initial...
متن کاملComparative Species Divergence across Eight Triplets of Spiny Lizards (Sceloporus) Using Genomic Sequence Data
Species divergence is typically thought to occur in the absence of gene flow, but many empirical studies are discovering that gene flow may be more pervasive during species formation. Although many examples of divergence with gene flow have been identified, few clades have been investigated in a comparative manner, and fewer have been studied using genome-wide sequence data. We contrast species...
متن کاملEfficient Maximum-Likelihood Inference For TheIsolation-With-Initial-Migration Model WithPotentially Asymmetric Gene Flow
The isolation-with-migration (IM) model is a common tool to make inferences about the presence of gene flow during speciation, using polymorphism data. However, Becquet and Przeworski (2009) report that the parameter estimates obtained by fitting the IM model are very sensitive to the model’s assumptions – including the assumption of constant gene flow until the present. This paper is concerned...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Molecular biology and evolution
دوره 29 10 شماره
صفحات -
تاریخ انتشار 2012